Amendments to the Claims: 

This listing of claims will replace all prior versions, and listings, of claims in the application: 

Listing of Claims: 

1 . (Currently amended) A method, in a data processing system, for parsing Eastern Asian language 
character streams, the method comprising: 

receiving a corpus of word-based parse trees , wherein each word-based parse tree in the corpus of 
word-based parse trees includes a word tag for each word in the word-based parse tree ; 

converting the corpus of word-based parse trees into a corpus of character-based parse trees,, 
wherein converting the coipus of word based parse trees includes assigning a word position tag to each 
character in the character-based parse tree based on the word tag for each word in the word-based parse 
tree : and 

training a character-based parser using the corpus of character-based parse trees, wherein the 
character-based parser is used at a character level, and wherein the character-based parser does not require 
a separate word-segmenter. 

2. -3. (Canceled) 

4. (Currently amended) The method of claim [[3]]1_, wherein the word position tag is one of a 
beginning tag, a middle tag, and an end tag. 

5. (Original) The method of claim 1, wherein training the parser includes forming a model. 

6. (Original) The method of claim 5, further comprising: 

providing the model to a decoder, wherein the decoder parses Eastern Asian language character 
streams at a character level using the model. 

7. (Original) The method of claim 6, further comprising: 

receiving a test sentence, wherein the test sentence is an Eastern Asian language character stream; 

and 

parsing the test sentence using the decoder to form one or more character-based parse trees. 
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8. (Original) The method of claim 1, wherein training the parser includes training the parser using 
maximum-entropy method. 

9. (Original) The method of claim 1, wherein the Eastern Asian language is one of Chinese, 
Japanese, and Korean. 

10. (Original) The method of claim 1, wherein the corpus of word-based parse trees is a Chinese 
Treebank. 

11. (Previously presented) An apparatus for parsing Eastern Asian language character streams, the 
apparatus comprising: 

means for receiving a coipus of word-based parse trees; 

means for converting the corpus of word-based parse trees into a corpus of character-based parse 
trees; and 

means for training a character-based parser using the corpus of character-based parse trees, 
wherein the character-based parser is used at a character level, and wherein the character-based parser 
does not require a separate word-segmenter. 

12. (Currently amended) A recordable-type medium encoding a computer program product, in a 
computer readable medium, for parsing Eastern Asian language character streams, the computer program 
product comprising: 

instructions for receiving a corpus of word-based parse trees , wherein each word-based parse tree 
in the corpus of word-based parse trees includes a word tag for each word in the word-based parse tree ; 

instructions for converting the corpus of word-based parse trees into a corpus of character-based 
parse trees , wherein the instructions for converting the corpus of word based parse trees includes 
instructions for assigning a word position tag to each character in the character-based parse tree based on 
the word tag for each word in the word-based parse tree ; and 

instructions for training a character-based parser using the coipus of character-based parse trees, 
wherein the character-based parser is used at a character level, and wherein the character-based parser 
does not require a separate word-segmenter. 

13. -14. (Canceled) 
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15. (Currently amended) The recordable-type medium encoding the computer program product of 
claim [[14]] 12, wherein the word position tag is one of a beginning tag, a middle tag, and an end tag. 

16. (Previously presented) The recordable-type medium encoding the computer program product of 
claim 12, wherein the instructions for training the parser includes instructions for forming a model. 

17. (Previously presented) The recordable-type medium encoding the computer program product of 
claim 16, further comprising: 

instructions for providing the model to a decoder, wherein the decoder parses Eastern Asian 
language character streams at a character level using the model. 

18. (Previously presented) The recordable-type medium encoding the computer program product of 
claim 17, further comprising: 

instructions for receiving an input sentence, wherein the input sentence is an Eastern Asian 
language character stream; and 

instructions for parsing the input sentence using the decoder to form one or more character-based 
parse trees. 

19. (Previously presented) The recordable-type medium encoding the computer program product of 
claim 12, wherein the instructions for training the parser includes instructions for training the parser using 
maximum-entropy method. 

20. (Previously presented) The recordable-type medium encoding the computer program product of 
claim 12, wherein the Eastern Asian language is one of Chinese, Japanese, and Korean. 
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